Cooperative Behavior Acquisition for Mobile Robots
نویسندگان
چکیده
In this paper, we rst discuss the meaning of physical embodiment and the complexity of the environment in the context of multiagent learning. We then propose a vision-based reinforcement learning method that acquires cooperative behaviors in a dynamic environment. We use the robot soccer game initiated by RoboCup [12] to illustrate the e ectiveness of our method. Each agent works with other team members to achieve a common goal against opponents. Our method estimates the relationships between a learner's behaviors and those of other agents in the environment through interactions (observations and actions) using a technique from system identi cation. In order to identify the model of each agent, Akaike's Information Criterion is applied to the results of Canonical Variate Analysis to clarify the relationship between the observed data in terms of actions and future observations. Next, reinforcement learning based on the estimated state vectors is performed to obtain the optimal behavior policy. The proposed method is applied to a soccer playing situation. The method successfully models a rolling ball and other moving agents and acquires the learner's behaviors. Computer simulations and real experiments are shown and a discussion is given. ? Partially supported by Japanese Society for Promotion of Science Project \Cooperative Distributed Vision for Dynamic Three Dimensional Scene Understanding." Project ID: JSPS-RFTF96P00501 1 E-mail:[email protected] Preprint submitted to Elsevier Science 6 November 1999
منابع مشابه
Cooperative Control of Mobile Robots in Creating a Runway Platform for Quadrotor Landing
Multi-agent systems are systems in which several agents accomplish a mission in a cooperative manner. In this paper, a novel idea for the construction of a movable runway platform based on multi-agent systems is presented. It is assumed that an aerial agent (quadrotor) decides to make an emergency landing due to reasons such as a decrease in energy level or technical failure, while there is no ...
متن کاملMulti-Robot Target Acquisition using Multiple Objective Behavior Coordination
In this paper we propose an approach to multi-robot coordination in the context of cooperative target acquisition. The approach is based on multiple objective behavior coordination extended to multiple robots. It provides mechanisms for distributed command fusion across a group of robots to pursue multiple goals of multiple robots in parallel. The mechanisms enable each robot to select actions ...
متن کاملCo-evolution for cooperative behavior acquisition in a multiple mobile robot environment
Co-evolution has been receiving increased attention as a method for multi agent simultaneous learning. This paper discusses how multiple robots can emerge cooperative behaviors through co-evolutionary processes. As an example task, a simpli ed soccer game with three learning robots is selected and a genetic programming method is applied to individual population corresponding to each robot so as...
متن کاملThree Examples of Learning Robots
Future robots, especially service and personal robots, will need much more intelligence, robustness and user-friendliness. The ability to learn contributes to these characteristics and is, therefore, becoming more and more important. Three of the numerous varieties of learning are discussed together with results of realworld experiments with three autonomous robots: (1) the acquisition of map k...
متن کاملCooperative Behavior Acquisition by Learning and Evolution of Vision-Motor Mapping for Mobile Robots
This paper proposes a number of learning and evolutionary methods contributed to realize cooperative behaviors among vision-based mobile robots in a dynamically changing environment. There are three difficult problems: partial observation, credit assignment, and synchronized learning. In order to solve these problems, we propose the fundamental model called Local Prediction Model, which can est...
متن کاملCooperative Behavior Acquisition in Multi Mobile Robots Environment by Reinforcement Learning Based on State Vector Estimation
This paper proposes a method that acquires the purposive behaviors based on the estimation of the state vectors. In order to acquire the cooperative behaviors in multi robot environments, each learning robot estimates local predictive model between the learner and the other objects separately. Based on the local predictive models, robots learn the desired behaviors using reinforcement learning....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999